Overview

Dataset statistics

Number of variables22
Number of observations8708
Missing cells29094
Missing cells (%)15.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.5 MiB
Average record size in memory181.2 B

Variable types

NUM15
CAT7

Reproduction

Analysis started2020-08-14 07:00:03.137025
Analysis finished2020-08-14 07:00:47.668685
Duration44.53 seconds
Software versionpandas-profiling v2.9.0rc1
Download configurationconfig.yaml

Warnings

name has a high cardinality: 8574 distinct values High cardinality
neighbourhood has a high cardinality: 112 distinct values High cardinality
amenities has a high cardinality: 8045 distinct values High cardinality
monthly_price is highly correlated with weekly_priceHigh correlation
weekly_price is highly correlated with monthly_priceHigh correlation
square_feet has 8671 (99.6%) missing values Missing
weekly_price has 7854 (90.2%) missing values Missing
monthly_price has 7972 (91.5%) missing values Missing
security_deposit has 2845 (32.7%) missing values Missing
cleaning_fee has 1632 (18.7%) missing values Missing
bathrooms is highly skewed (γ1 = 20.8395) Skewed
maximum_nights is highly skewed (γ1 = 93.0233) Skewed
name is uniformly distributed Uniform
amenities is uniformly distributed Uniform
id has unique values Unique
bedrooms has 841 (9.7%) zeros Zeros
beds has 249 (2.9%) zeros Zeros
security_deposit has 2356 (27.1%) zeros Zeros
cleaning_fee has 424 (4.9%) zeros Zeros
extra_people has 4874 (56.0%) zeros Zeros

Variables

id
Real number (ℝ≥0)

UNIQUE

Distinct count8708
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.24806e+07
Minimum3663
Maximum4.3835e+07
Zeros0
Zeros (%)0.0%
Memory size68.0 KiB

Quantile statistics

Minimum3663
5-th percentile2.66113e+06
Q11.35745e+07
median2.06516e+07
Q33.37451e+07
95-th percentile4.25411e+07
Maximum4.3835e+07
Range4.38313e+07
Interquartile range (IQR)2.01707e+07

Descriptive statistics

Standard deviation1.24425e+07
Coefficient of variation (CV)0.553479
Kurtosis-1.0909
Mean2.24806e+07
Median Absolute Deviation (MAD)1.00897e+07
Skewness0.108562
Sum1.95761e+11
Variance1.54817e+14
MonotocityStrictly increasing
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
1.61239e+071< 0.1%
 
2.80117e+071< 0.1%
 
4.35693e+071< 0.1%
 
1.35016e+071< 0.1%
 
2.19324e+071< 0.1%
 
3555271< 0.1%
 
2.20008e+071< 0.1%
 
7221141< 0.1%
 
3.39475e+061< 0.1%
 
2.85449e+071< 0.1%
 
Other values (8698)869899.9%
 
ValueCountFrequency (%) 
36631< 0.1%
 
36701< 0.1%
 
36861< 0.1%
 
37711< 0.1%
 
39431< 0.1%
 
ValueCountFrequency (%) 
4.3835e+071< 0.1%
 
4.38346e+071< 0.1%
 
4.38345e+071< 0.1%
 
4.382e+071< 0.1%
 
4.38197e+071< 0.1%
 

name
Categorical

HIGH CARDINALITY
UNIFORM

Distinct count8574
Unique (%)98.5%
Missing2
Missing (%)< 0.1%
Memory size68.0 KiB
NW DC 30 Day Stays One Bedroom
 
10
Spacious Queen Room in Columbia Heights
 
9
Spacious Queen Room in Shaw
 
8
Sojourn the 13th Street Flats
 
6
Shaw Convention Center Apartments 30 Day Stays Two Bedroom
 
5
Other values (8569)
8668 
ValueCountFrequency (%) 
NW DC 30 Day Stays One Bedroom100.1%
 
Spacious Queen Room in Columbia Heights90.1%
 
Spacious Queen Room in Shaw80.1%
 
Sojourn the 13th Street Flats60.1%
 
Shaw Convention Center Apartments 30 Day Stays Two Bedroom50.1%
 
Walter Convention Center Apartments 30 Day Stays Two Bedroom50.1%
 
Very Comfy & Cute Luxe Twin Bed in Shared Room 150.1%
 
Spacious Queen Room in Capitol Hill50.1%
 
Lux 1 Bedroom near White House w/wifi4< 0.1%
 
Spacious Full Room in Columbia Heights4< 0.1%
 
Other values (8564)864599.3%
 

Length

Max length98
Median length42
Mean length39.6918
Min length1

neighbourhood
Categorical

HIGH CARDINALITY

Distinct count112
Unique (%)1.3%
Missing0
Missing (%)0.0%
Memory size14.4 KiB
Capitol Hill
1020 
Dupont Circle
 
488
Columbia Heights
 
432
Near Northeast/H Street Corridor
 
393
Logan Circle
 
388
Other values (107)
5987 
ValueCountFrequency (%) 
Capitol Hill102011.7%
 
Dupont Circle4885.6%
 
Columbia Heights4325.0%
 
Near Northeast/H Street Corridor3934.5%
 
Logan Circle3884.5%
 
Shaw3754.3%
 
U Street Corridor3403.9%
 
Petworth2973.4%
 
Adams Morgan2653.0%
 
Foggy Bottom2242.6%
 
Other values (102)448651.5%
 

Length

Max length35
Median length12
Mean length13.1548
Min length4
Distinct count39
Unique (%)0.4%
Missing0
Missing (%)0.0%
Memory size10.1 KiB
Union Station, Stanton Park, Kingman Park
835 
Columbia Heights, Mt. Pleasant, Pleasant Plains, Park View
821 
Capitol Hill, Lincoln Park
808 
Dupont Circle, Connecticut Avenue/K Street
679 
Edgewood, Bloomingdale, Truxton Circle, Eckington
636 
Other values (34)
4929 
ValueCountFrequency (%) 
Union Station, Stanton Park, Kingman Park8359.6%
 
Columbia Heights, Mt. Pleasant, Pleasant Plains, Park View8219.4%
 
Capitol Hill, Lincoln Park8089.3%
 
Dupont Circle, Connecticut Avenue/K Street6797.8%
 
Edgewood, Bloomingdale, Truxton Circle, Eckington6367.3%
 
Shaw, Logan Circle5976.9%
 
Brightwood Park, Crestwood, Petworth4855.6%
 
Kalorama Heights, Adams Morgan, Lanier Heights4004.6%
 
Downtown, Chinatown, Penn Quarters, Mount Vernon Square, North Capitol Street3814.4%
 
Howard University, Le Droit Park, Cardozo/Shaw3363.9%
 
Other values (29)273031.4%
 

Length

Max length97
Median length42
Mean length43.613
Min length18

zipcode
Categorical

Distinct count49
Unique (%)0.6%
Missing67
Missing (%)0.8%
Memory size10.1 KiB
20002
1468 
20009
1239 
20001
1142 
20003
763 
20011
649 
Other values (44)
3380 
ValueCountFrequency (%) 
20002146816.9%
 
20009123914.2%
 
20001114213.1%
 
200037638.8%
 
200116497.5%
 
200105165.9%
 
200073904.5%
 
200053033.5%
 
200192593.0%
 
200202442.8%
 
Other values (39)166819.2%
 

Length

Max length10
Median length5
Mean length5.02871
Min length1

property_type
Categorical

Distinct count22
Unique (%)0.3%
Missing0
Missing (%)0.0%
Memory size9.3 KiB
Apartment
3767 
House
1793 
Townhouse
1300 
Condominium
739 
Guest suite
503 
Other values (17)
606 
ValueCountFrequency (%) 
Apartment376743.3%
 
House179320.6%
 
Townhouse130014.9%
 
Condominium7398.5%
 
Guest suite5035.8%
 
Serviced apartment3103.6%
 
Boutique hotel590.7%
 
Guesthouse550.6%
 
Loft540.6%
 
Bed and breakfast370.4%
 
Other values (12)911.0%
 

Length

Max length18
Median length9
Mean length8.79536
Min length4

room_type
Categorical

Distinct count4
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size8.7 KiB
Entire home/apt
6159 
Private room
2306 
Shared room
 
199
Hotel room
 
44
ValueCountFrequency (%) 
Entire home/apt615970.7%
 
Private room230626.5%
 
Shared room1992.3%
 
Hotel room440.5%
 

Length

Max length15
Median length15
Mean length14.0889
Min length10

accommodates
Real number (ℝ≥0)

Distinct count16
Unique (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.60622
Minimum1
Maximum16
Zeros0
Zeros (%)0.0%
Memory size68.0 KiB

Quantile statistics

Minimum1
5-th percentile1
Q12
median3
Q34
95-th percentile8
Maximum16
Range15
Interquartile range (IQR)2

Descriptive statistics

Standard deviation2.35845
Coefficient of variation (CV)0.653994
Kurtosis5.93212
Mean3.60622
Median Absolute Deviation (MAD)1
Skewness1.99351
Sum31403
Variance5.56228
MonotocityNot monotonic
Histogram with fixed size bins (bins=16)
ValueCountFrequency (%) 
2308535.4%
 
4188521.6%
 
395210.9%
 
68199.4%
 
17698.8%
 
54795.5%
 
83063.5%
 
71211.4%
 
10931.1%
 
12560.6%
 
Other values (6)1431.6%
 
ValueCountFrequency (%) 
17698.8%
 
2308535.4%
 
395210.9%
 
4188521.6%
 
54795.5%
 
ValueCountFrequency (%) 
16530.6%
 
1590.1%
 
14180.2%
 
13100.1%
 
12560.6%
 

bathrooms
Real number (ℝ≥0)

SKEWED

Distinct count21
Unique (%)0.2%
Missing8
Missing (%)0.1%
Infinite0
Infinite (%)0.0%
Mean1.35494
Minimum0
Maximum50
Zeros17
Zeros (%)0.2%
Memory size68.0 KiB

Quantile statistics

Minimum0
5-th percentile1
Q11
median1
Q31.5
95-th percentile2.5
Maximum50
Range50
Interquartile range (IQR)0.5

Descriptive statistics

Standard deviation0.887767
Coefficient of variation (CV)0.655206
Kurtosis1046.44
Mean1.35494
Median Absolute Deviation (MAD)0
Skewness20.8395
Sum11788
Variance0.788129
MonotocityNot monotonic
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%) 
1614470.6%
 
292210.6%
 
1.57338.4%
 
2.55025.8%
 
3.51491.7%
 
31331.5%
 
4.5320.4%
 
4310.4%
 
0170.2%
 
0.570.1%
 
Other values (11)300.3%
 
(Missing)80.1%
 
ValueCountFrequency (%) 
0170.2%
 
0.570.1%
 
1614470.6%
 
1.57338.4%
 
292210.6%
 
ValueCountFrequency (%) 
501< 0.1%
 
112< 0.1%
 
10.51< 0.1%
 
103< 0.1%
 
91< 0.1%
 

bedrooms
Real number (ℝ≥0)

ZEROS

Distinct count13
Unique (%)0.1%
Missing8
Missing (%)0.1%
Infinite0
Infinite (%)0.0%
Mean1.36908
Minimum0
Maximum27
Zeros841
Zeros (%)9.7%
Memory size68.0 KiB

Quantile statistics

Minimum0
5-th percentile0
Q11
median1
Q32
95-th percentile3
Maximum27
Range27
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.00521
Coefficient of variation (CV)0.734223
Kurtosis54.282
Mean1.36908
Median Absolute Deviation (MAD)0
Skewness3.59388
Sum11911
Variance1.01045
MonotocityNot monotonic
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%) 
1527160.5%
 
2162718.7%
 
08419.7%
 
36507.5%
 
42092.4%
 
5660.8%
 
6180.2%
 
790.1%
 
93< 0.1%
 
83< 0.1%
 
Other values (3)3< 0.1%
 
(Missing)80.1%
 
ValueCountFrequency (%) 
08419.7%
 
1527160.5%
 
2162718.7%
 
36507.5%
 
42092.4%
 
ValueCountFrequency (%) 
271< 0.1%
 
111< 0.1%
 
101< 0.1%
 
93< 0.1%
 
83< 0.1%
 

beds
Real number (ℝ≥0)

ZEROS

Distinct count21
Unique (%)0.2%
Missing35
Missing (%)0.4%
Infinite0
Infinite (%)0.0%
Mean1.88724
Minimum0
Maximum51
Zeros249
Zeros (%)2.9%
Memory size68.0 KiB

Quantile statistics

Minimum0
5-th percentile1
Q11
median1
Q32
95-th percentile4
Maximum51
Range51
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.6558
Coefficient of variation (CV)0.877366
Kurtosis183.064
Mean1.88724
Median Absolute Deviation (MAD)1
Skewness8.1993
Sum16368
Variance2.74166
MonotocityNot monotonic
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%) 
1428049.2%
 
2236227.1%
 
394610.9%
 
44144.8%
 
02492.9%
 
51802.1%
 
61111.3%
 
7380.4%
 
8340.4%
 
9140.2%
 
Other values (11)450.5%
 
(Missing)350.4%
 
ValueCountFrequency (%) 
02492.9%
 
1428049.2%
 
2236227.1%
 
394610.9%
 
44144.8%
 
ValueCountFrequency (%) 
511< 0.1%
 
501< 0.1%
 
191< 0.1%
 
181< 0.1%
 
171< 0.1%
 

amenities
Categorical

HIGH CARDINALITY
UNIFORM

Distinct count8045
Unique (%)92.4%
Missing0
Missing (%)0.0%
Memory size399.9 KiB
{TV,"Cable TV",Wifi,"Air conditioning",Kitchen,"Pets allowed",Gym,Elevator,Heating,Washer,Dryer,"Smoke alarm","Carbon monoxide alarm",Essentials,Shampoo,Hangers,"Hair dryer",Iron,"Laptop-friendly workspace","Private entrance","Hot water","Bed linens","Extra pillows and blankets",Microwave,"Coffee maker",Refrigerator,Dishwasher,"Dishes and silverware","Cooking basics",Oven,Stove,"BBQ grill","Patio or balcony","Paid parking on premises"}
 
45
{TV,"Cable TV",Wifi,"Air conditioning",Pool,Kitchen,"Pets allowed",Gym,Heating,Washer,Dryer,"Smoke alarm","Carbon monoxide alarm",Essentials,Shampoo,Hangers,"Hair dryer",Iron,"Laptop-friendly workspace","Private entrance","Hot water","Bed linens","Extra pillows and blankets","Ethernet connection",Microwave,"Coffee maker",Refrigerator,Dishwasher,"Dishes and silverware","Cooking basics",Oven,Stove}
 
29
{TV,"Cable TV",Wifi,"Air conditioning",Pool,Kitchen,"Pets allowed",Gym,Heating,Washer,Dryer,"Smoke alarm","Carbon monoxide alarm",Essentials,Shampoo,Hangers,"Hair dryer",Iron,"Laptop-friendly workspace","Private entrance","Hot water","Bed linens","Extra pillows and blankets","Ethernet connection",Microwave,"Coffee maker",Refrigerator,Dishwasher,"Dishes and silverware","Cooking basics",Oven,Stove,"Shower gel","Trash can"}
 
28
{Wifi,"Air conditioning",Kitchen,Breakfast,Heating,"Smoke alarm","Carbon monoxide alarm","First aid kit","Fire extinguisher",Essentials,Shampoo,Hangers,"Hair dryer",Iron,"Self check-in",Keypad,"Room-darkening shades","Bed linens",Microwave,Refrigerator,"Dishes and silverware","Cooking basics",Oven,Stove,"Long term stays allowed"}
 
19
{TV,Wifi,"Air conditioning",Kitchen,Heating,Washer,Dryer,"Smoke alarm","Carbon monoxide alarm",Essentials,Shampoo,"Laptop-friendly workspace",Microwave,"Coffee maker",Refrigerator,Dishwasher,"Dishes and silverware","Cooking basics",Oven,Stove,"Long term stays allowed"}
 
18
Other values (8040)
8569 
ValueCountFrequency (%) 
{TV,"Cable TV",Wifi,"Air conditioning",Kitchen,"Pets allowed",Gym,Elevator,Heating,Washer,Dryer,"Smoke alarm","Carbon monoxide alarm",Essentials,Shampoo,Hangers,"Hair dryer",Iron,"Laptop-friendly workspace","Private entrance","Hot water","Bed linens","Extra pillows and blankets",Microwave,"Coffee maker",Refrigerator,Dishwasher,"Dishes and silverware","Cooking basics",Oven,Stove,"BBQ grill","Patio or balcony","Paid parking on premises"}450.5%
 
{TV,"Cable TV",Wifi,"Air conditioning",Pool,Kitchen,"Pets allowed",Gym,Heating,Washer,Dryer,"Smoke alarm","Carbon monoxide alarm",Essentials,Shampoo,Hangers,"Hair dryer",Iron,"Laptop-friendly workspace","Private entrance","Hot water","Bed linens","Extra pillows and blankets","Ethernet connection",Microwave,"Coffee maker",Refrigerator,Dishwasher,"Dishes and silverware","Cooking basics",Oven,Stove}290.3%
 
{TV,"Cable TV",Wifi,"Air conditioning",Pool,Kitchen,"Pets allowed",Gym,Heating,Washer,Dryer,"Smoke alarm","Carbon monoxide alarm",Essentials,Shampoo,Hangers,"Hair dryer",Iron,"Laptop-friendly workspace","Private entrance","Hot water","Bed linens","Extra pillows and blankets","Ethernet connection",Microwave,"Coffee maker",Refrigerator,Dishwasher,"Dishes and silverware","Cooking basics",Oven,Stove,"Shower gel","Trash can"}280.3%
 
{Wifi,"Air conditioning",Kitchen,Breakfast,Heating,"Smoke alarm","Carbon monoxide alarm","First aid kit","Fire extinguisher",Essentials,Shampoo,Hangers,"Hair dryer",Iron,"Self check-in",Keypad,"Room-darkening shades","Bed linens",Microwave,Refrigerator,"Dishes and silverware","Cooking basics",Oven,Stove,"Long term stays allowed"}190.2%
 
{TV,Wifi,"Air conditioning",Kitchen,Heating,Washer,Dryer,"Smoke alarm","Carbon monoxide alarm",Essentials,Shampoo,"Laptop-friendly workspace",Microwave,"Coffee maker",Refrigerator,Dishwasher,"Dishes and silverware","Cooking basics",Oven,Stove,"Long term stays allowed"}180.2%
 
{}150.2%
 
{TV,Wifi,Kitchen,"Pets allowed",Heating,Washer,Dryer,"Smoke alarm","Carbon monoxide alarm",Essentials,Shampoo,Hangers,"Hair dryer",Iron,"Laptop-friendly workspace","Private entrance","Ethernet connection"}150.2%
 
{TV,"Cable TV",Internet,Wifi,"Air conditioning",Kitchen,"Pets allowed",Gym,Elevator,Heating,"Family/kid friendly",Washer,Dryer,"Smoke alarm","Carbon monoxide alarm",Essentials,Shampoo,Hangers,"Hair dryer",Iron,"Laptop-friendly workspace",Bathtub,"Hot water","Bed linens",Refrigerator,Oven,Stove,"Long term stays allowed"}130.1%
 
{TV,"Cable TV",Wifi,"Air conditioning",Kitchen,"Pets allowed",Gym,Elevator,Heating,Washer,Dryer,"Smoke alarm","Carbon monoxide alarm",Essentials,Shampoo,Hangers,"Hair dryer",Iron,"Self check-in",Lockbox,"Private entrance","Bed linens",Microwave,"Coffee maker",Refrigerator,Dishwasher,Oven,"Patio or balcony","Long term stays allowed"}130.1%
 
{Wifi,"Air conditioning",Kitchen,Breakfast,Heating,Washer,Dryer,"Smoke alarm","Carbon monoxide alarm","First aid kit","Fire extinguisher",Essentials,Shampoo,"Lock on bedroom door","Hair dryer",Iron}110.1%
 
Other values (8035)850297.6%
 

Length

Max length1386
Median length363
Mean length377.223
Min length2

square_feet
Real number (ℝ≥0)

MISSING

Distinct count25
Unique (%)67.6%
Missing8671
Missing (%)99.6%
Infinite0
Infinite (%)0.0%
Mean1185.95
Minimum0
Maximum5300
Zeros1
Zeros (%)< 0.1%
Memory size68.0 KiB

Quantile statistics

Minimum0
5-th percentile228
Q1575
median1000
Q31400
95-th percentile3200
Maximum5300
Range5300
Interquartile range (IQR)825

Descriptive statistics

Standard deviation1040
Coefficient of variation (CV)0.876934
Kurtosis7.066
Mean1185.95
Median Absolute Deviation (MAD)425
Skewness2.43383
Sum43880
Variance1.08159e+06
MonotocityNot monotonic
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%) 
12004< 0.1%
 
10003< 0.1%
 
5003< 0.1%
 
11002< 0.1%
 
6002< 0.1%
 
15002< 0.1%
 
4502< 0.1%
 
16002< 0.1%
 
01< 0.1%
 
5051< 0.1%
 
Other values (15)150.2%
 
(Missing)867199.6%
 
ValueCountFrequency (%) 
01< 0.1%
 
1401< 0.1%
 
2501< 0.1%
 
4502< 0.1%
 
5003< 0.1%
 
ValueCountFrequency (%) 
53001< 0.1%
 
40001< 0.1%
 
30001< 0.1%
 
24001< 0.1%
 
17001< 0.1%
 

price
Real number (ℝ≥0)

Distinct count465
Unique (%)5.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean199.031
Minimum0
Maximum10000
Zeros2
Zeros (%)< 0.1%
Memory size68.0 KiB

Quantile statistics

Minimum0
5-th percentile40
Q177
median110
Q3175
95-th percentile549
Maximum10000
Range10000
Interquartile range (IQR)98

Descriptive statistics

Standard deviation475
Coefficient of variation (CV)2.38656
Kurtosis249.67
Mean199.031
Median Absolute Deviation (MAD)40
Skewness13.9135
Sum1.73317e+06
Variance225625
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
1003824.4%
 
1503053.5%
 
752593.0%
 
802232.6%
 
902152.5%
 
1252142.5%
 
852062.4%
 
501902.2%
 
1201872.1%
 
991852.1%
 
Other values (455)634272.8%
 
ValueCountFrequency (%) 
02< 0.1%
 
1080.1%
 
132< 0.1%
 
141< 0.1%
 
1570.1%
 
ValueCountFrequency (%) 
1000090.1%
 
99991< 0.1%
 
700070.1%
 
59951< 0.1%
 
55001< 0.1%
 

weekly_price
Real number (ℝ≥0)

HIGH CORRELATION
MISSING

Distinct count254
Unique (%)29.7%
Missing7854
Missing (%)90.2%
Infinite0
Infinite (%)0.0%
Mean837.029
Minimum99
Maximum10000
Zeros0
Zeros (%)0.0%
Memory size68.0 KiB

Quantile statistics

Minimum99
5-th percentile300
Q1500
median693
Q3970.75
95-th percentile2000
Maximum10000
Range9901
Interquartile range (IQR)470.75

Descriptive statistics

Standard deviation656.987
Coefficient of variation (CV)0.784903
Kurtosis52.6779
Mean837.029
Median Absolute Deviation (MAD)228
Skewness5.26031
Sum714823
Variance431631
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
500450.5%
 
600450.5%
 
700300.3%
 
1000290.3%
 
400290.3%
 
750270.3%
 
800240.3%
 
900240.3%
 
450230.3%
 
650190.2%
 
Other values (244)5596.4%
 
(Missing)785490.2%
 
ValueCountFrequency (%) 
991< 0.1%
 
1192< 0.1%
 
1501< 0.1%
 
1751< 0.1%
 
1791< 0.1%
 
ValueCountFrequency (%) 
100001< 0.1%
 
60001< 0.1%
 
49001< 0.1%
 
42002< 0.1%
 
41651< 0.1%
 

monthly_price
Real number (ℝ≥0)

HIGH CORRELATION
MISSING

Distinct count240
Unique (%)32.6%
Missing7972
Missing (%)91.5%
Infinite0
Infinite (%)0.0%
Mean2778.75
Minimum400
Maximum30000
Zeros0
Zeros (%)0.0%
Memory size68.0 KiB

Quantile statistics

Minimum400
5-th percentile900
Q11550
median2300
Q33399.25
95-th percentile6000
Maximum30000
Range29600
Interquartile range (IQR)1849.25

Descriptive statistics

Standard deviation2069.33
Coefficient of variation (CV)0.744696
Kurtosis45.5266
Mean2778.75
Median Absolute Deviation (MAD)800
Skewness4.72402
Sum2.04516e+06
Variance4.28211e+06
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
3000250.3%
 
1500250.3%
 
2500230.3%
 
3500200.2%
 
1800190.2%
 
2200180.2%
 
4000160.2%
 
1200150.2%
 
2800140.2%
 
2000140.2%
 
Other values (230)5476.3%
 
(Missing)797291.5%
 
ValueCountFrequency (%) 
4001< 0.1%
 
4202< 0.1%
 
4831< 0.1%
 
6301< 0.1%
 
6851< 0.1%
 
ValueCountFrequency (%) 
300001< 0.1%
 
150002< 0.1%
 
138501< 0.1%
 
120002< 0.1%
 
100002< 0.1%
 

security_deposit
Real number (ℝ≥0)

MISSING
ZEROS

Distinct count92
Unique (%)1.6%
Missing2845
Missing (%)32.7%
Infinite0
Infinite (%)0.0%
Mean246.906
Minimum0
Maximum5000
Zeros2356
Zeros (%)27.1%
Memory size68.0 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median100
Q3300
95-th percentile1000
Maximum5000
Range5000
Interquartile range (IQR)300

Descriptive statistics

Standard deviation444.061
Coefficient of variation (CV)1.7985
Kurtosis46.5551
Mean246.906
Median Absolute Deviation (MAD)100
Skewness5.5569
Sum1.44761e+06
Variance197190
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
0235627.1%
 
5007678.8%
 
1006147.1%
 
2504134.7%
 
2003894.5%
 
1502943.4%
 
3002683.1%
 
10001882.2%
 
4001001.1%
 
1500650.7%
 
Other values (82)4094.7%
 
(Missing)284532.7%
 
ValueCountFrequency (%) 
0235627.1%
 
95210.2%
 
9970.1%
 
1006147.1%
 
1051< 0.1%
 
ValueCountFrequency (%) 
5000160.2%
 
48001< 0.1%
 
45002< 0.1%
 
43001< 0.1%
 
38001< 0.1%
 

cleaning_fee
Real number (ℝ≥0)

MISSING
ZEROS

Distinct count167
Unique (%)2.4%
Missing1632
Missing (%)18.7%
Infinite0
Infinite (%)0.0%
Mean82.7114
Minimum0
Maximum605
Zeros424
Zeros (%)4.9%
Memory size68.0 KiB

Quantile statistics

Minimum0
5-th percentile0
Q130
median60
Q3100
95-th percentile250
Maximum605
Range605
Interquartile range (IQR)70

Descriptive statistics

Standard deviation78.4482
Coefficient of variation (CV)0.948456
Kurtosis5.47009
Mean82.7114
Median Absolute Deviation (MAD)35
Skewness2.06756
Sum585266
Variance6154.11
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
506537.5%
 
1004625.3%
 
754314.9%
 
04244.9%
 
253373.9%
 
203303.8%
 
1502613.0%
 
302472.8%
 
602332.7%
 
802332.7%
 
Other values (157)346539.8%
 
(Missing)163218.7%
 
ValueCountFrequency (%) 
04244.9%
 
5370.4%
 
690.1%
 
760.1%
 
880.1%
 
ValueCountFrequency (%) 
6051< 0.1%
 
6002< 0.1%
 
5502< 0.1%
 
5204< 0.1%
 
5003< 0.1%
 

guests_included
Real number (ℝ≥0)

Distinct count15
Unique (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.78594
Minimum1
Maximum16
Zeros0
Zeros (%)0.0%
Memory size68.0 KiB

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q32
95-th percentile5
Maximum16
Range15
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.48098
Coefficient of variation (CV)0.82924
Kurtosis11.4306
Mean1.78594
Median Absolute Deviation (MAD)0
Skewness2.85966
Sum15552
Variance2.19329
MonotocityNot monotonic
Histogram with fixed size bins (bins=15)
ValueCountFrequency (%) 
1559564.3%
 
2169119.4%
 
47128.2%
 
32643.0%
 
61942.2%
 
51111.3%
 
8620.7%
 
7370.4%
 
10250.3%
 
1280.1%
 
Other values (5)90.1%
 
ValueCountFrequency (%) 
1559564.3%
 
2169119.4%
 
32643.0%
 
47128.2%
 
51111.3%
 
ValueCountFrequency (%) 
163< 0.1%
 
151< 0.1%
 
141< 0.1%
 
1280.1%
 
111< 0.1%
 

extra_people
Real number (ℝ≥0)

ZEROS

Distinct count64
Unique (%)0.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12.3699
Minimum0
Maximum300
Zeros4874
Zeros (%)56.0%
Memory size68.0 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q320
95-th percentile50
Maximum300
Range300
Interquartile range (IQR)20

Descriptive statistics

Standard deviation24.2877
Coefficient of variation (CV)1.96345
Kurtosis54.0673
Mean12.3699
Median Absolute Deviation (MAD)0
Skewness5.79491
Sum107717
Variance589.893
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
0487456.0%
 
106747.7%
 
256537.5%
 
206026.9%
 
155075.8%
 
503173.6%
 
302132.4%
 
351481.7%
 
51351.6%
 
401021.2%
 
Other values (54)4835.5%
 
ValueCountFrequency (%) 
0487456.0%
 
51351.6%
 
680.1%
 
7110.1%
 
8240.3%
 
ValueCountFrequency (%) 
300190.2%
 
25060.1%
 
2251< 0.1%
 
20070.1%
 
1791< 0.1%
 

minimum_nights
Real number (ℝ≥0)

Distinct count61
Unique (%)0.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.42708
Minimum1
Maximum600
Zeros0
Zeros (%)0.0%
Memory size68.0 KiB

Quantile statistics

Minimum1
5-th percentile1
Q11
median2
Q33
95-th percentile30
Maximum600
Range599
Interquartile range (IQR)2

Descriptive statistics

Standard deviation23.6425
Coefficient of variation (CV)3.18328
Kurtosis186.356
Mean7.42708
Median Absolute Deviation (MAD)1
Skewness11.3438
Sum64675
Variance558.966
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
2298434.3%
 
1245828.2%
 
3131615.1%
 
306567.5%
 
43243.7%
 
52232.6%
 
71802.1%
 
14790.9%
 
10590.7%
 
6580.7%
 
Other values (51)3714.3%
 
ValueCountFrequency (%) 
1245828.2%
 
2298434.3%
 
3131615.1%
 
43243.7%
 
52232.6%
 
ValueCountFrequency (%) 
6001< 0.1%
 
5552< 0.1%
 
3654< 0.1%
 
3641< 0.1%
 
3603< 0.1%
 

maximum_nights
Real number (ℝ≥0)

SKEWED

Distinct count151
Unique (%)1.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean258741
Minimum1
Maximum2.14748e+09
Zeros0
Zeros (%)0.0%
Memory size68.0 KiB

Quantile statistics

Minimum1
5-th percentile5
Q130
median1125
Q31125
95-th percentile1125
Maximum2.14748e+09
Range2.14748e+09
Interquartile range (IQR)1095

Descriptive statistics

Standard deviation2.30377e+07
Coefficient of variation (CV)89.0374
Kurtosis8670.38
Mean258741
Median Absolute Deviation (MAD)0
Skewness93.0233
Sum2.25312e+09
Variance5.30734e+14
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
1125442050.8%
 
304865.6%
 
3653744.3%
 
73193.7%
 
142382.7%
 
902342.7%
 
51691.9%
 
601601.8%
 
281571.8%
 
31561.8%
 
Other values (141)199522.9%
 
ValueCountFrequency (%) 
170.1%
 
2430.5%
 
31561.8%
 
41311.5%
 
51691.9%
 
ValueCountFrequency (%) 
2.14748e+091< 0.1%
 
1e+081< 0.1%
 
999991< 0.1%
 
100002< 0.1%
 
33652< 0.1%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

Sample

First rows

idnameneighbourhoodneighbourhood_cleansedzipcodeproperty_typeroom_typeaccommodatesbathroomsbedroomsbedsamenitiessquare_feetpriceweekly_pricemonthly_pricesecurity_depositcleaning_feeguests_includedextra_peopleminimum_nightsmaximum_nights
03663Classic Rowhouse: Porch+ART+ParkingManor ParkBrightwood Park, Crestwood, Petworth20011TownhouseEntire home/apt43.54.02.0{TV,"Cable TV",Internet,Wifi,"Air conditioning",Kitchen,"Free parking on premises","Free street parking",Heating,"Family/kid friendly",Washer,Dryer,"Smoke alarm","Carbon monoxide alarm","First aid kit","Fire extinguisher",Essentials,Shampoo,"24-hour check-in",Hangers,"Hair dryer",Iron,"Laptop-friendly workspace","Self check-in",Lockbox,Bathtub,"Children’s books and toys","Pack ’n Play/travel crib","Children’s dinnerware","Hot water","Luggage dropoff allowed","Long term stays allowed"}NaN154.0NaNNaN0.050.040.0330
13670Beautiful Sun-Lit U Street 1BR/1BAU Street CorridorHoward University, Le Droit Park, Cardozo/Shaw20009TownhousePrivate room21.01.01.0{Wifi,"Air conditioning","Pets live on this property",Dog(s),Heating,"Family/kid friendly","Smoke alarm","First aid kit","Fire extinguisher",Essentials,Shampoo,"Lock on bedroom door",Hangers,"Hair dryer","translation missing: en.hosting_amenity_50","Hot water","Extra pillows and blankets","Host greets you"}NaN75.0600.0NaN500.025.010.0230
23686Vita's HideawayAnacostiaHistoric Anacostia20020HousePrivate room11.01.01.0{Internet,Wifi,Kitchen,"Free street parking","Indoor fireplace",Heating,"Family/kid friendly",Washer,Dryer,"Smoke alarm","Carbon monoxide alarm","First aid kit",Essentials,Shampoo,Hangers,"Hot water","Bed linens","Extra pillows and blankets",Microwave,"Coffee maker",Refrigerator,Dishwasher,"Dishes and silverware","Cooking basics",Oven,Stove,"Patio or balcony","Garden or backyard"}NaN55.0430.0975.00.00.010.02365
33771Mt. PleasantMount PleasantColumbia Heights, Mt. Pleasant, Pleasant Plains, Park View20009OtherPrivate room21.01.01.0{"Cable TV","Air conditioning",Heating,"Smoke alarm","Carbon monoxide alarm","First aid kit","Safety card","Fire extinguisher"}NaN88.0NaNNaNNaNNaN10.011125
43943Historic Rowhouse Near MonumentsEckingtonEdgewood, Bloomingdale, Truxton Circle, Eckington20002TownhousePrivate room21.01.01.0{Internet,Wifi,"Air conditioning","Free parking on premises","Pets live on this property",Cat(s),Heating,Washer,Dryer,"Smoke alarm","First aid kit","Fire extinguisher",Essentials,Shampoo,Hangers,"Hair dryer",Iron,"Laptop-friendly workspace","translation missing: en.hosting_amenity_49","translation missing: en.hosting_amenity_50","Self check-in",Keypad,"Hot water","Bed linens","Coffee maker"}NaN80.0NaN1800.0NaNNaN115.0271125
54197Bedroom in DC 2 blocks to MetroCapitol HillCapitol Hill, Lincoln Park20003HousePrivate room21.51.01.0{TV,Wifi,"Air conditioning",Kitchen,"Pets live on this property","Free street parking","Indoor fireplace",Heating,Washer,Dryer,"Smoke alarm","Carbon monoxide alarm","First aid kit","Safety card","Fire extinguisher",Essentials,Shampoo,"Lock on bedroom door",Hangers,"Hair dryer",Iron,"Laptop-friendly workspace","translation missing: en.hosting_amenity_49","translation missing: en.hosting_amenity_50",Bathtub,"Fireplace guards","Hot water","Bed linens","Extra pillows and blankets",Microwave,"Coffee maker",Refrigerator,Dishwasher,"Dishes and silverware","Cooking basics",Oven,Stove,"Patio or balcony","Garden or backyard","Luggage dropoff allowed","Long term stays allowed","Host greets you","Shower gel","Baking sheet","Trash can"}NaN83.0NaNNaNNaN35.010.028365
64501DC RowhouseShawShaw, Logan Circle20001HousePrivate room21.52.02.0{TV,"Cable TV",Internet,Wifi,"Air conditioning",Kitchen,"Free parking on premises",Breakfast,Heating,Washer,Dryer,"Smoke alarm","Carbon monoxide alarm","Fire extinguisher",Essentials,Shampoo,"Lock on bedroom door",Hangers,"Hair dryer",Iron,"Laptop-friendly workspace","translation missing: en.hosting_amenity_49"}NaN150.0NaNNaN500.00.0250.03090
74967DC, Near MetroIvy CityIvy City, Arboretum, Trinidad, Carver Langston20002HousePrivate room13.01.01.0{"Cable TV",Internet,Wifi,"Air conditioning",Kitchen,"Smoking allowed","Indoor fireplace",Heating,Washer,Dryer,"Smoke alarm","Carbon monoxide alarm","Fire extinguisher","Lock on bedroom door","translation missing: en.hosting_amenity_49","translation missing: en.hosting_amenity_50"}NaN99.0600.01500.0NaNNaN10.02365
85589Cozy apt in Adams MorganAdams MorganKalorama Heights, Adams Morgan, Lanier Heights20009ApartmentEntire home/apt31.01.01.0{TV,Internet,Wifi,"Air conditioning",Kitchen,"Paid parking off premises","Free street parking","Indoor fireplace","Buzzer/wireless intercom",Heating,"Family/kid friendly",Washer,Dryer,"Smoke alarm","First aid kit","Safety card","Fire extinguisher",Essentials,Shampoo,Hangers,"Hair dryer",Iron,"Laptop-friendly workspace","Self check-in",Lockbox,"Private entrance",Bathtub,"Window guards","Room-darkening shades","Hot water","Bed linens","Extra pillows and blankets","Ethernet connection",Microwave,"Coffee maker",Refrigerator,Dishwasher,"Dishes and silverware","Cooking basics",Oven,Stove,"Luggage dropoff allowed"}NaN118.0650.02500.0100.015.0110.0313
97103Best of Washington - Great neighborhood, parkingBerkleySpring Valley, Palisades, Wesley Heights, Foxhall Crescent, Foxhall Village, Georgetown Reservoir20007Guest suiteEntire home/apt21.01.02.0{TV,"Cable TV",Internet,Wifi,"Air conditioning",Kitchen,"Pets allowed","Free street parking",Heating,"Family/kid friendly",Washer,Dryer,"Smoke alarm","Carbon monoxide alarm","First aid kit","Safety card","Fire extinguisher",Essentials,Shampoo,"Lock on bedroom door","24-hour check-in",Hangers,"Hair dryer",Iron,"Laptop-friendly workspace","Private living room","Private entrance","Hot water","Bed linens","Extra pillows and blankets",Microwave,"Coffee maker",Refrigerator,"Dishes and silverware","BBQ grill","Garden or backyard","Luggage dropoff allowed","Long term stays allowed","Cleaning before checkout","Host greets you","Shower gel","Trash can"}NaN99.0506.01517.00.035.0115.03365

Last rows

idnameneighbourhoodneighbourhood_cleansedzipcodeproperty_typeroom_typeaccommodatesbathroomsbedroomsbedsamenitiessquare_feetpriceweekly_pricemonthly_pricesecurity_depositcleaning_feeguests_includedextra_peopleminimum_nightsmaximum_nights
869843818019Vibrant apartment in the heart of DCLogan CircleKalorama Heights, Adams Morgan, Lanier Heights20009ApartmentEntire home/apt51.02.02.0{TV,Wifi,"Air conditioning",Kitchen,Heating,Washer,Dryer,"Smoke alarm","First aid kit",Essentials,Hangers,"Hair dryer",Iron,"Laptop-friendly workspace"}NaN119.0NaNNaN0.0150.010.01545
869943818065Cozy ☆ Well-Located Brookland Condo ☆ Paid ParkingLogan CircleBrookland, Brentwood, Langdon20017CondominiumEntire home/apt41.01.01.0{TV,Wifi,"Air conditioning","Free parking on premises",Heating,"Smoke alarm","Carbon monoxide alarm",Essentials,Shampoo,Hangers,"Hair dryer",Iron,"Laptop-friendly workspace","Private entrance"}NaN99.0NaNNaN300.070.0110.0214
870043818698Cozy ☆ Brookland Condo ☆ Sleeps 6 ☆ Paid ParkingLogan CircleBrookland, Brentwood, Langdon20017CondominiumEntire home/apt61.02.04.0{Wifi,"Air conditioning","Free parking on premises",Heating,"Smoke alarm","Carbon monoxide alarm",Essentials,Hangers,"Hair dryer",Iron,"Laptop-friendly workspace","Private entrance"}NaN123.0NaNNaN400.0100.0210.0214
870143818983Cozy ☆ Capitol Hill 2 BR Condo ☆ On Penn AveLogan CircleCapitol Hill, Lincoln Park20003CondominiumEntire home/apt51.02.03.0{TV,Wifi,"Air conditioning",Kitchen,Heating,"Smoke alarm","Carbon monoxide alarm",Essentials,Hangers,"Hair dryer",Iron,"Laptop-friendly workspace","Private entrance"}NaN129.0NaNNaN500.0100.0210.0230
870243819475Visually stunning contemporary condoLogan CircleCleveland Park, Woodley Park, Massachusetts Avenue Heights, Woodland-Normanstone Terrace20016LoftEntire home/apt41.52.02.0{TV,Wifi,"Air conditioning",Kitchen,"Free parking on premises",Breakfast,Heating,Washer,Dryer,"Smoke alarm","Carbon monoxide alarm","First aid kit","Fire extinguisher",Essentials,Shampoo,Hangers,"Hair dryer",Iron,"Laptop-friendly workspace","Private entrance"}NaN500.0NaNNaNNaN100.02100.011125
870343819661Sojourn on Constitution Avenue 1 BedroomCapitol HillUnion Station, Stanton Park, Kingman Park20002HouseEntire home/apt21.01.01.0{TV,Wifi,"Air conditioning",Kitchen,Heating,"Smoke alarm","Carbon monoxide alarm","First aid kit","Fire extinguisher",Essentials,Shampoo,"Lock on bedroom door",Hangers,"Hair dryer",Iron,"Laptop-friendly workspace","Self check-in",Keypad,"Private entrance"}NaN225.0NaNNaN0.0100.010.031125
870443819954Sojourn on Constitution Sleeps 5 | Outdoor SpaceLogan CircleCapitol Hill, Lincoln Park20002HouseEntire home/apt51.02.02.0{TV,Wifi,"Air conditioning",Kitchen,Heating,Washer,Dryer,"Smoke alarm","Carbon monoxide alarm","First aid kit","Fire extinguisher",Essentials,Shampoo,"Lock on bedroom door",Hangers,"Hair dryer",Iron,"Private entrance"}NaN325.0NaNNaN0.0150.010.031125
8705438344952~BD 2~BA LUXURY CITY CENTER DC PENTHOUSEDowntown/Penn QuarterDowntown, Chinatown, Penn Quarters, Mount Vernon Square, North Capitol Street20268ApartmentEntire home/apt62.02.02.0{TV,Wifi,"Air conditioning",Pool,Kitchen,"Free parking on premises",Gym,Elevator,Heating,"Suitable for events",Washer,Dryer,"Smoke alarm","First aid kit",Essentials,Shampoo,Hangers,"Hair dryer",Iron,"Laptop-friendly workspace","Hot water"}NaN99.0NaNNaNNaN120.010.01365
870643834631Nice for a couple roomCapitol HillUnion Station, Stanton Park, Kingman Park20002HousePrivate room21.51.02.0{"Air conditioning",Kitchen,"Free parking on premises","Pets allowed",Breakfast,"Indoor fireplace",Heating,Washer,Dryer,"Smoke alarm","Carbon monoxide alarm","First aid kit","Fire extinguisher",Essentials,Shampoo,"Lock on bedroom door",Hangers,"Private living room"}NaN100.0NaNNaNNaNNaN10.0128
870743834997The Chic suite on 16th st NW 1BD~1BAColumbia HeightsKalorama Heights, Adams Morgan, Lanier Heights20009ApartmentEntire home/apt31.01.02.0{TV,Wifi,"Air conditioning",Kitchen,Gym,Heating,"Suitable for events",Washer,Dryer,"Smoke alarm","First aid kit",Essentials,Shampoo,Hangers,"Hair dryer",Iron,"Laptop-friendly workspace","Hot water"}NaN70.0NaNNaNNaN100.010.01365